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ABSTRACT 

Recent ISO-data, has allowed for the first time observationally based estimates for 
source confusion in mid-infrared surveys. We use the extragalactic source counts from 
ISOCAM in conjunction with if-band counts to predict the confusion due to galaxies 
in deep mid-IR observations. We specifically concentrate on the near-future Space 
Infrared Telescope Facility {SIRTF) mission, and calculate expected confusion for the 
Infrared Array Camera (IRAC) onboard SIRTF. A defining scientific goal of the IRAC 
instrument will be the study of high redshift galaxies using a deep, confusion limited 
wide field survey at 3-10 |im. A deep survey can reach 3 |J.Jy sources with reasonable 
confidence in the shorter wavelength IRAC bands. Truly confusion limited images with 
the 8|i.m will be difficult to obtain due to practical time constraints, unless infrared 
galaxies exhibit very strong evolution beyond the deepest current observations. We find 
L* galaxies to be detectable to z=3-3.5 at 8 \xm, which is slightly more pessimistic 
than found by Simpson & Eisenhardt (1999). 

Key words: infrared: galaxies - cosmology: observations - galaxies: evolution - 
galaxies: photometry - methods: observational - techniques: photometric 



1 INTRODUCTION 

One of the successes of the Infrared Space Observatory [ISO) 
mission have been the new extragalactic source counts in 
the mid-infrared, extracted from both deep (Oliver et al. 
1997, Elbaz et al. 1999) and wide field (Serjeant et al. 2000) 
surveys. These counts are important in planning effective 
observing strategies for the next generation of space-borne 
IR missions, the SIRTF (Fanson et al. 1998) and ASTRO-F 
(Pearson et al. 2000), particularly with respect to expected 
source densities and confusion. 

Confusion is usually defined as the fluctuations of the 
background sky brightness below which sources cannot be 
detected individually - these fluctuations are caused by in- 
trinsically discrete extragalactic sources (we do not consider 
confusion due to Galactic cirrus in this work) . Thus the fluc- 
tuations, or noise, come from the same sources one wishes 
to study. 

Confusion sets an important and fundamental limit to 
astronomical observations. For a given wavelength and reso- 
lution, it will not be productive to extend the exposure time 
indefinitely since a flux density limit will be reached beyond 
which no additional discrete sources can be extracted. This 
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is the limit usually referred to as the confusion limit. Confu- 
sion cannot be avoided even with arbitrarily high resolution 
because of the finite sizes of sources on the sky. 

Just as new sub-mm data allowed Blain, Ivison & Small 
(1998) to set observational limits on source confusion at 
those wavelengths, we are now able to estimate more ac- 
curately the confusion at mid-infrared wavelengths using 
deep galaxy counts from ISO data. Confusion in mid-IR has 
been discussed in connection with the individual deep sur- 
veys (e.g. Oliver et al. 1997 for the ISO HDF survey). 

A useful theoretical definition of confusion was devel- 
oped by Scheuer (1957) and followed up in numerous other 
works. More recently, essentially the same formalism has 
been used to predict confusion in observations done at ra- 
dio (e.g. Franceschini 1989, WaU et al. 1982, Condon 1974), 
sub-mm (Toffolatti et al. 1998), and X-ray (Barcons 1992, 
Scheuer 1974) wavelengths. It turns out that sources at the 
fiux density level where the surface density is approximately 
1 source per beam produce the bulk of the confusion noise. 
This is easy to understand: at higher surface densities the 
multitude of sources per beam suppresses the fluctuations 
(contributing more to the absolute background level) , while 
at lower surface density levels the many 'empty' beams also 
decrease the fluctuations. 

However, confusion is not a very straightforward limit 
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since it is a function of wavelength, resolution, and other in- 
strument response dependent factors. Moreover, calculations 
of confusion limits for a given instrument are highly source 
count -model dependent. 'Confusion' is also sometimes con- 
fused with source detection probabilities, i.e. completeness 
level estimation when performing source extraction and pho- 
tometry. Bright sources are often neglected from confusion 
calculations, but these bright objects can be a major factor 
in lowering the completeness of detections in a given sur- 
vey. The aspect of completeness vs. confusion is perhaps the 
least-studied in the otherwise extensive literature on confu- 
sion. 

The goal of this paper is two-fold: i) to study the re- 
lation of confusion noise to actual detection of sources, and 
thus arrive at a practical definition for confusion, and ii) 
to predict this confusion limit in mid-IR specifically for the 
forth-coming space infrared mission, the SIRTF. 

We start from analytical and theoretical confusion de- 
termination and proceed to a more observer oriented com- 
pleteness analysis. In particular, we will define confusion 
'noise' and confusion 'limit' and investigate their relation to 
completeness levels in source extraction. We specify when 
the different methods of deriving confusion are most appro- 
priate and present both methods for IRAC. The relevant 
quantitative numbers for confusion with IRAC/ SIRTF are 
presented in Section |^ and Table 1. 



2 ANALYTICAL SOLUTION FOR 
CONFUSION NOISE 

Given the source-count distribution and the response pat- 
tern of the detector, we can estimate the confusion analyti- 
cally - this has been done by many authors (see above) , here 
we mainly follow the notation of Franceschini et al. (1989, 
1991). 

Let N{S) be the differential distribution of sources of 
flux S {N{S) is used for differential counts throughout this 
paper instead of dN/dS). We assume that the sources are 
distributed randomly on the sky. Now let us include the 
information of the observing instrument: let f{6,ip) be the 
point-spread profile (normalized to unity) of the detector. 
Then the amplitude of the response at location [6, tp) from 
the beam axis, is 



(1) 



The mean number of responses, R{x), with amplitudes be- 
tween X and X -I- da; in a solid angle dfi is: 



R{x)dx - 



N 



dO, 



dx. 



(2) 



where we integrate over the whole beam ^b- 

The intrinsic fluctuations of R{x) can be written simply 
as the second moment aconf of the R(x) distribution: 



2 

^conf 



X R{x)dx. 



(3) 



The lower integration limit is because S is always pos- 
itive. The upper limit is not so straightforward; the variance 
would diverge if high-flux truncations are not introduced in 
either the source counts N{S) or the response distribution 
R{x). The limit is often set to Xc ~ Qcr, with Q in the range 



of 3-5. This kind of upper limit makes the calculation an it- 
erative process. The logic of having a response cut-off rather 
than a cut-off at specific flux S, is that the latter would elim- 
inate the contribution by bright sources alltogether whereas 
in reality their presence - even relatively far away from the 
beam center - could significantly affect the measurement 
due to the wings of their PSFs. 

Notice that if Q = 3, for example, it does not follow that 
the objects detected will be 'Sa'-detections in the conven- 
tional sense. The Q-value merely tells how large responses 
are included in the calculation of the confusion: this cor- 
responds to setting a limit below which a response would 
not be detected with sufficient confidence as such. Thus for 
Poissonian and Gaussian distributions Q = 3 corresponds to 
a probability 0.3 per cent that a response x results from 
a large fluctuation of the background. The (Tcont value from 
Eq. w must be multiplied by 3 to get the '3a confusion limit' 
for the detection of objects regardless of the Q used. 

For simplicity we will assume a Gaussian beam pattern 



/(*) = exp(4*ln2). 



(4) 



where ^' — {6/6o)^ ■ Performing the integration over the solid 
angle fit, Eq. ^can be written as 

f^coniiXc) = TTell{Xc), (5) 

where 



/(*) 



x^dx I N i -J^ 1 exp(4*ln2)d*. (6) 



10 Jo 

Because of the integration limit [xc ~ Qu) it is easier to 
compute Oo (the FWHM of the beam) as a function of a, 
i.e. 



9o = a/{nl(x,))'^' 

and then determine the confusion for a given 6^0. 

If the source counts have a power-law form, N{S) 
kS^ , Eq. ^ can be integrated to give 

^ coni — I, „ ) -^c 

3-7 



(7) 



(8) 



where Q,e — ^ fi^V ^dO, is the effective beam size 
(e.g. Condon 1974, Hacking & Houck 1987). If the Gaus- 
sian beam is assumed, as above, we can find a relation 



„ /j2/(7-l) 
l^conf OC 



(9) 



in the absence of cut-offs Sc in source fluxes. 

In summary, it is evident that confusion will be a func- 
tion of the beam size, the source counts, cut-offs related to 
the source and response distribution, and the Q-term: 



O-conf = O^onf{d0,'N{S),Xc,Sc,Q) 



3 THE IRAC INSTRUMENT 



(10) 



The Infrared Array Camera, (Fazio et al. 1998), is one of 
the three focal plane instruments on board SIRTF. IRAC 
is a four-channel camera that provides simultaneous images 
at 3.6, 4.5, 5.8, and 8 ^m. Two adjacent 5.12' x 5.12' fields 
of view are seen by these four channels in pairs. All four 
detectors are 256 x 256 pixels, with 1.2"/pixel scale. The 
two short wavelength channels use InSb detectors while the 
two longer wavelengths have Si:As IBC detectors. 
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In the following, we concentrate on the shortest and 
longest wavebands, the 3.6 and 8 [im channels. The sensitiv- 
ities in these two bands and characteristics of galaxy models, 
as well as confusion estimates bracket those expected from 
the two intermediate channels. The expected 5cr sensitivities 
in 200 s for the 3.6 and 8 [im bands are 3.1 and 23.2 \iJy, re- 
spectively, for point sources, and 0.9 and 5.1 ^Jy/arcsec^ for 
extended sources. The bandwidths are 21 and 38 per cent, 
respectively. 

The major scientific objectives for IRAC and SIRTF 
are i) to study the early universe; ii) to study ultraluminous 
galaxies and active galactic nuclei; iii) to search for and 
study brown dwarfs and superplanets, and iv) to discover 
and study protoplanetary and planetary debris disks. These 
objectives are also mirrored in the six large 'SIRTF Legacy 
Science Programs', which will utilize nearly 3200 hours of 
SIRTF observing time primarily during the first year of the 
mission. The projects are: 'The SIRTF Galactic Plane Sur- 
vey'; GOODS: Great Observatories Origins Deep Survey'; 
'From Molecular Clouds to Planets'; 'SINGS: The SIRTF 
Nearby Galaxies Survey - Physics of the Star-Forming ISM 
and Galaxy Evolution'; 'SWIRE: The SIRTF Wide-area In- 
frared Extragalactic Survey'; and 'The Formation and Evo- 
lution of Planetary Systems: Placing Our Solar System in 
Context'. All of these are expected to result in substantial 
databases that will be invaluable for archival research, and 
in planning subsequent programs on SIRTF and on other 
space-borne, airborne, and ground-based observatories. 

While the motivation of this paper is connected to deep 
surveys of the high redshift universe (e.g. GOODS), con- 
fusion limit due to galaxies is a factor also in the deeper 
brown dwarf searches. Additionally, confusion due to stars 
becomes important in these surveys when operating close 
to the Galactic plane, at 6 < 20° (e.g. the Galactic Plane 
Survey project) . 
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Figure 1. Simulated IRAC 3.6 \xm band counts. The crosses show 
actual /C-band galaxy counts shifted to the shortest wavelength 
IRAC band. The model used in the transformation includes pure 
luminosity evolution, uses Gardner et al. (1996) ftT-band local lu- 
minosity function, and has open cosmology. The calculated counts 
of the same model are plotted as the thick solid line. The low- 
est dashed line shows counts from the same model but with flat 
go = 0.5 cosmology, whereas the long-dashed curve uses flat, cos- 
mological constant dominated cosmology (Aq = 0.8). The dash- 
dot line is calculated using an alternative LF, that of Szokoly et 
al. (1998), which results in a slightly higher normalization at the 
bright end. 



4 SOURCE COUNT MODELS 
4.1 Baseline model 

We explore the effects of confusion and completeness with 
simple models of mid-infrared galaxy counts. As a first step, 
observed near-infrared if-band source counts are extrapo- 
lated to mid-IR. 

To predict the counts at another wavelength, the 
specifics of the observing filter are needed, as well as the 
shape of the spectral energy distribution (SED) of the con- 
tributing sources at different redshifts and the numbers of 
these sources. The idea is to construct a model which fits 
reasonably well the observed counts, and then calculate the 
counts of the target band with the same model. Thus, a 
consistent extrapolation requires knowledge of the SEDs of 
different galaxy types, the local galaxy luminosity functions 
(LF) of the types, possible evolution of the sources, and 
a cosmological model. The observed counts themselves can 
then be shifted to the other band using information of the 
predicted colors as a function of magnitude. 

In the following the baseline model is an extrapola- 
tion, such as described above, constructed to fit observed 
if -band counts. To compute the galaxy counts we used 
the general formalism presented earlier in Vaisanen (1996). 



For the SEDs and luminosity evolution of galaxies we use 
the 'bc96' models (Bruzual & Chariot 1993) as presented 
in Gardner (1998). We also made use of ncmod, a useful 
general purpose galaxy number count model presented in 
Gardner (1998). We adopt a prescription for internal ab- 
sorption by dust in galaxies from therein as well as the in- 
cluded six different galaxy types (passively evolving E/SO, 
Sab, Sbc, Scd, Irr, and a small population of constantly star- 
forming dwarfs) . The galaxy mix is from Yoshii & Takahara 
(1988) and the /f-band local luminosity function (same for 
all galaxy types) from Gardner et al. (1997). For cosmology 
we used Ho — 50kms~^Mpc~^ and go ~ 0.02. 

Using the model above, we shift the observed Jf-band 
counts, a compilation from Vaisanen et al. 2000, to 3.6 [im 
- these are shown as crosses in Fig. |^. The predicted source 
count of the same model is overplotted as the thick solid line. 
This same baseline model is calculated also for 6.7 and 8\im 
bands. The baseline model corresponds to the one in Gard- 
ner's (1998) Figure 11. There are other models overplotted in 
Fig. 0, corresponding to different cosmologies. The dash-dot 
curve uses the local near-IR luminosity function of galaxies 
from Szokoly et al. (1998), which we have found to predict 
near-IR counts fitting well our own wide-field galaxy counts 
(Vaisanen et al. 2000). 
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4.2 /SO-data based model 

For the 5.8 and 8 \im predictions, the mid-IR spectral prop- 
erties of galaxies become important and extrapolations such 
as used for the baseline model are not trustworthy. This is 
because the 'bc96' SEDs used above do not include the broad 
emission features in the spectra of galaxies between 3 and 
12 |i.m, commonly thought to be the signature of Polycyclic 
Aromatic Hydrocarbons (PAH). While global properties of 
galaxy populations at mid-IR remain yet uncertain, recent 
ISO data certainly has shown that PAH features are an im- 
portant factor which cannot be ignored (eg. Mattila, Lehti- 
nen & Lemke 1999, Genzel & Cesarsky 2000). This is partic- 
ularly true for any dusty and infrared bright objects, which 
might well dominate counts at mid-IR. Modelling of the rel- 
evant dust and re-radiation processes is complicated (see 
eg. Guiderdoni et al. 1998, Silva et al. 1998, Xu et al. 1998); 
therefore, to be as observationally based as possible, our cho- 
sen recipe for predicting counts for the longer IRAC bands, 
is to take the newly available mid-IR extragalactic counts at 
6.7 \im and shift these to the neighbouring IRAC bands. 

We thus fitted an experimental source count slope to the 
6.7 \im extragalactic counts found by various ISOCAM sur- 
veys: ELAIS (Serjeant et al. 2000), ISO-HDF project (Oliver 
et al. 1997), a survey at the Lockman Hole (Taniguchi 
et al. 1997), and the CFRS (Flores et al. 1999). The faint 
slope beyond the deepest observations is adjusted to typi- 
cal shape of existing (evolutionary) models (Franceschini et 
al. 1997, Pearson & Rowan- Robinson 1996, Roche & Bales 
1999). Preliminary ISO-counts from the HDF-S field are also 
in excellent agreement with this experimental source count 
model (Oliver et al. 2001, in preparation). Though various 
independent counts are in very good agreement, it should 
be noted that there still are uncertainties in the calibration 
of the ISOCAM data (see eg. Serjeant et al. 2000, Aussel et 
al. 1999). Note also that the ELAIS counts are not corrected 
for a possible Eddington/Malmquist type bias due to a non- 
Gaussian flux-error distribution (see Figure 1. in Serjeant et 
al. 2000). 

Fig. 1^ shows the recent mid-IR counts with our fitted 
model overplotted as the thick solid curve and the equivalent 
of the 3.6 \i.m baseline model as the lower thin solid line. We 
then translated both the 6.7 \im counts and the fitted model 
to the 8 |J.m band (Fig. ^). 

Naturally the translation to the 8 \im band is model de- 
pendent, but since the shift is not large (the ISOCAM filter 
partially overlaps the IRAC 8 \uxi filter), the uncertainty is 
low, even given the uncertain mid-IR spectral features. We 
used the baseline model in the transformation. We computed 
numerous model counts at 6.7 and 8 (xm, with different LFs, 
galaxy mixes, cosmologies, SEDs (with and without PAH 
features), and found that all these changes in the trans- 
formation of counts are well within the error bars of the 
observed counts. 



4.3 Reliability of models 

As is seen from Fig. ^ the observations do not constrain 
the models fainter than about 20 ^.Jy, so the faint counts 
therefore remain model-based. However, the baseline model 
provides a realistic slope prediction since it is derived from 
near-IR counts, which extend two decades fainter. Further- 
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Figure 2. Observed differential 6.7 |am ISOCAM counts: wide- 
field ELAIS counts (Serjeant et al. 2000; squares), deep HDF-N 
(Oliver et al. 1997; asterisks), Lockman Hole (Taniguchi et al. 
1997; triangles), and CFRS data (Flores et al. 1999; diamonds). 
The thick curve is a fit to these ISOCAM data. The thin solid line 
shows the prediction of the same baseline model as used in the 
previous figure. The dashed curve is the model from Franceschini 
et al. (1997). 



more, the faint counts result mostly from high-z objects seen 
at their redshifted near-IR rest frames, which are well mod- 
elled by the adopted SEDs. 

We briefly compare some 6.7 ^im model counts with the 
models used here. We have plotted the model of Franceschini 
et al. (1997) in Fig. ^for reference. Models of Franceschini et 
al. (1991) are similar to the baseline curve here. Pearson & 
Rowan- Robinson (1996) models match our thick solid curve 
very well at and beyond the break point. The evolving model 
of Roche & Eales (1999) is very close to our experimental 
ISO-fitted curve. Overall, the faint mid-IR counts at 6.7 ^im 
(and especially at 15 \im) are surprisingly high and have a 
steep slope. 

At brighter flux levels, the 6.7 (xm counts fit well the 
models of Pearson & Rowan- Robinson (1996), as well as 
those of Roche & Eales (1999), while they are in clear ex- 
cess of models of e.g. Franceschini (1997) and the baseline 
prediction of our near-IR extrapolated model. 

Where does this large difference at the mjy level num- 
ber count regime come from? One explanation is the effect of 
PAH features. As a test, we calculated the baseline model in 
an alternate way, substituting all the spiral galaxy SEDs be- 
yond 1 fi.m with mid-IR spectra taken from GRASIL results 
(Silva et al. 1998) , which include the broad mid-IR spectral 
features. The effect at 1 mJy is substantial: the 6.7 [i-m band 
there is almost a factor of 10 increase in counts, raising the 
baseline model just to the level of our experimental counts. 



© 2000 RAS, MNRAS 000, §-13 



Confusion limit due to galaxies: using SIRTF 's Infrared Array Camera 5 




-3 -4 -5 -6 

log FLUX [Jy] 



Figure 3. Simulated 8 IRAC counts. The data points are 
those from Fig. |2| shifted to the 8 yun band. The thick soUd Une is 
translated from the /50-data fitted curve of the previous figure, 
and the thin solid line is the baseline model. The dashed and dot- 
ted lines show ad hoc shapes for the faint end of the counts, used 
to examine the effect on confusion limit and the depth reached 
with the IRAC instrument (Section 5.3). 
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Figure 4. Confusion in the 3.6 pm band as a function of FWHM 
of a Gaussian beam. The solid line is the calculation based on 
the baseline model described in the text. The short dashed and 
long-dash lines show results for the same galaxy population and 
evolution model, but with different cosmologies (instead of an 
open model, they use a matter dominated and A-dominated flat 
cosmologies, respectively). The dash-dot curve shows confusion 
using an alternative LP (Szokoly et al. 1998). The dotted lines 
show the effect of the Q-parameter in calculation of (Tconf; the 
upper has Q = 5 and the lower Q = 2, while Q = 3 is used for 
the other curves and elsewhere in the paper. 



At 8 nm the increase is a factor of 30, which takes the base- 
line curve slightly higher than the ISO-fitted experimental 
curve. By 10 nJy the counts with and without PAH features 
are within a factor of two again. Whether it is appropriate 
to use such SEDs for the whole population of disk galaxies 
is uncertain. Nevertheless, it is clear that the experimental 
ISO-fitted counts used here are by no means unrealistic, and, 
moreover, the PAH features in mid-IR spectra are a crucial 
factor in explaining mid-IR counts. 



5 DERIVING CONFUSION 
5.1 Confusion vs. beam size 

Equation ^ is used to compute confusion as a function of 
FWHM of the beam size (0o)- This is equivalent to con- 
fusion limit curves in papers of Franceschini et al. (1989, 
1991). Several different cases are plotted in Figs. ^ and ^ 
corresponding to source counts in Figs. |l] and ^. 

Figure ^ shows effects in the 3.6 ^m IRAC filter confu- 
sion due to typical changes in cosmology and of parameters 
in the cTconf calculation. The confusion noise (Jconf for the 
baseline model is calculated using (3 = 3 (see eq. |^ and dis- 
cussion following it) and plotted as the solid line. Confusion 
from the same model is calculated also using Q = 2 and 5; 
these are plotted as dotted lines, below and above the first 
curve, respectively. The first model is then modified to flat 
cosmology (go = 0.5, dashed line; it is unrealistic, but is 
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Figure 5. Confusion as function of FWHM in the 8 pm band. The 
source counts which result in these confusion curves are Fig. ^ 
The thick solid curve corresponds to counts fitted to the 6.7 pm 
ISOCAM counts and shifted to this band. The dashed and dot- 
ted lines show results from ad hoc models of fai nt co unts, not 
constrained by present observational data (Section 5.3). 
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included to compare with some earlier determinations) and 
to flat cosmology with a cosmological constant (Ao = 0.8; 
long-dash). Finally, the dash-dot curve corresponds to the 
same alternative local LF as used in Fig. |^. 

Depending on which model one adopts, YRAC/ SIRTFs 
1(7 confusion noise from the realistic models at 3.6 \im seems 
to be 0.2 - 0.5 ^Jy for the IRAC resolution of ^ 2.3". It is 
noteworthy, that changing the Q value changes the confu- 
sion as much as extreme changes in cosmology. It is thus 
important to check which Q value was used when directly 
adopting some confusion calculation for a given mission (eg. 
Toffolatti et al. 1998 use (5 = 5 when predicting confusion 
in far-IR and sub-mm). 

Figure ^ shows various models for the IRAC 8^m fil- 
ter. Next we investigate changes in confusion due to differ- 
ent galaxy population models. The solid curve shows the 
adopted baseline model, equivalent to the baseline 3.6 jim 
model. The corresponding source count can be found in 
Fig. 1^. The baseline model is close to the confusion pre- 
diction of Franceschini et al. (1991), which has been used 
during the S I RT F plamimg phases. The resulting ISO-fitted 
confusion curve is plotted as the thick solid line. The ex- 
pected la confusion noise is (Tconf ~ 0.6(i.Jy. 

It turns out, that the flux levels contributing most to the 
confusion of the longer wavelength IRAC-filter are just be- 
yond the sensitivity limit of deepest ISOCAM observations. 
Thus the exact amplitude and location of the 'bump' seen in 
the mid-IR counts essentially determines the expec ted I RAC 
confusion limit. We return to this issue in Section 5.3 



^ -6 



cn 
o 



3.6yu.m 



□ 5'" 



confusion 'NOISE' 

2° conf 'LIMITS' 
1 b/s 

LIMIT of 70% 
completeness 



4 6 
flo [orcsec] 



10 



Figure 6. Comparison of confusion noise and confusion limit 
at 3.6 \im. The solid line is the same as the baseline confusion 
noise model in Fig. Q The Itr confusion limit calculated from 
one-beam-per-source analysis is shown as the dotted line, and 
the corresponding curve for 20 beams/source as the dashed ine . 
Photometric simulations using an actual image (see Section 6.1) 
constructed from the same baseline model yield 70 per cent com- 
pleteness limits (as a function of the FWHM of a Gaussian PSF) 
which are shown as squares. It is seen that realistic limits to source 
detection are close to the 20 beams/source limit. 



5.2 Confusion limit from surface densities 



Another often used method of defining confusion is to cal- 
culate the flux density where the surface density of objects 
per beam becomes unity. This is done for the same models 
as above, using Ue// = 7r(So/2)^/ ln2 as the approximate 
effective beam area (e.g. Condon 1974). Sets of results for 
both wavebands are shown in Figs. ||and@. 

The confusion determination from the surface density 
will be called the confusion limit (dashed and dotted lines). 
In contrast, we will refer to the result of confusion calcu- 
lation discussed in the previous section ((Tconf, Eq. P), as 
confusion noise (solid lines). Both would be called 'Icr' con- 
fusion in the literature (e.g. Blain 1998 et al.), while it is 
clear that there is a difference in the value of inferred con- 
fusion. The noise is a factor of 3 to 5 higher than the limit, 
the difference being larger at smaller FWHM's. It is unreal- 
istic to adopt the 1-beam-per-source 'rule of the thumb' as 
an estimate of confusion. Indeed, often the confusion limit 
is set to some beams per source, where N is in the range 
20-50. 

Although both of the above confusion limit analyses are 
often used, they are at opposite ends with respect to their 
approach. The analytical calculation presented first deter- 
mines the fluctuations caused by unresolved sources. The 
second method is based on the realization that the bulk 
of fluctuations resulting from these unresolved sources are 
caused by sources which have a source density of about one 
per beam. Using any given (cumulative) source count law, 
one can calculate where that level is, and assign it as the 
confusion limit. This latter method thus ignores the unre- 
solved sources, and instead starts out by calculating num- 
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Figure 7. Same as previous figure, but for the 8 \xm band. The 
effect of steepness in the 8 (im count slope is seen in the confu- 
sion curve: it is flatter over the range of resolution shown here 
compared to the 3.6 \ini confusion curve as predicted by Eq. ^ 
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bers of bright sources. It is thus more related to completeness 
analysis in counting of individual galaxies - hence the term 
confusion limit used here. Additionally, it is difficult to as- 
sociate a meaningful noise figure, one which could be added 
to other noise sources, to this confusion limit; it does not 
add in quadrature like random noise. 



5.3 Sensitivity of confusion to models 

As seen earlier, the 6.7 [im model is not constrained by ob- 
servations below ~ 20 \i3y. While the faint extrapolation is 
based on near-IR data and realistic models, it is worth-while 
to investigate how sensitive the confusion noise prediction is 
to the model-dependent deep counts in the 8 ^.m band. Con- 
sider four ad hoc number count slopes beyond the ISO-daia. 
regime. The dashed curves in Fig. ^ show models where the 
amplitude of the bump at the break point of the counts 
(at ~ lO^iJy) is either higher or lower than expected from 
models fitted to ISOCAM data. The dotted curves assume 
that the break is determined correctly, but the faint slope is 
shallower or steeper than the one adopted previously. 

Fig. ^ shows the calculated o"conf for each of these source 
counts. It is obvious that the faint end slope (beyond « 6 
jijy) does not significantly affect the confusion level. The 
amplitude of the bump, on the other hand, is the most im- 
portant factor contributing to confusion noise. 

The dependence of confusion on the 'bump' is expected. 
The 8 [im sources start to approach beam-per-source surface 
densities with IRAC resolution at about the same region 
where the bump, or break, in the counts is. In addition, 
with steep counts, the sources at the cut-off (or roll-over) 
level are the population contributing most to confusion (eg. 
Toffolatti et al. 1998). 

Basically, depending on whether one normalizes the 
faintest 6.7 \im counts according to the HDF-N counts (high; 
Oliver et al. 1997), or to the Lockman hole counts (low; 
Taniguchi et al. 1997), one gets almost a factor of 10 dif- 
ference in (Tconf at IRAC resolution. The deep ISOCAM ob- 
servations of Altieri et al. (1999), through a cluster- lens, 
support the high normalization. 

Nevertheless, a word of warning concerning the mid-IR 
ISOCAM counts is warranted. Some of the observations have 
been pushed quite deep, for example, beyond a 20 beams- 
per-source level. Our photometric simulations show that the 
accuracy of photometry is very low at these levels, and the 
trend is systematically towards higher fluxes. In addition to 
unstable photometry, astrometric errors become significant 
(Hogg 2000). Thus, until the counts are confirmed by better 
resolution (and hopefully larger sky-coverage), it remains a 
possibility, that the deepest mid-IR counts (both at 6.7 and 
15 fim) are significantly over-estimated. If that would be the 
case, the 8 \im baseline model would be more appropriate 
for estimating confusion levels than the ISO-fitted one. 



6 DISCUSSION 

6.1 Completeness of counts in IRAC bands 

Both completeness and reliability must be considered when 
detecting faint sources. Certain detection techniques might 
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Figure 8. Cumulative counts of the 3.6 \xm baseline model 
(dashed curve) with 'observed' counts from simulations (soUd 
lines). In aj the curves correspond to different convolutions: the 
top-most has a 1.5" FWHM Gaussian PSF, and the rest have 
PSF's larger in steps of 0.5" upto 8.5". The thick solid line cor- 
responds approximately to the actual IRAC beam size. In b) the 
extracted curves all use a realistic IRAC PSF [6 2.3") of this 
band. The topmost (thick) line has no noise added. The rest of 
the curves have increasing amount of random noise added to the 
image. See text for details. 



give high completeness rates while suffering from poor reli- 
ability. On the other hand, requiring high reliability means 
lower completeness at a given fiux level. We study both as- 
pects, along with the effect of source confusion, using sim- 
ulations of images which are realizations of model counts 
introduced earlier. 

An analogue of the aconf ( 



9q) calculation in Section 5.1 



to test how the extraction of sources from a simulated image 
changes with the convolving PSF. We constructed simulated 
images from the baseline 3.6 and 8 \im models - Figs. ^ 
and Ha show the 3.6 and 8 \i.m intrinsic cumulative model 
counts as dashed curves. Sources were drawn from N{S) and 
distributed randomly on an image with 1.2" pixels (fractions 
of pixels were allowed as the center of a source). The im- 
age size was 768 pixels^, corresponding to a 3x3 mosaic of 
IRAC frames. The image was then convolved with different 
Gaussian PSFs and sources were extracted using SExtrac- 
tor (Berlin & Arnouts 1996) and aperture magnitudes with 
approximately the size of the FWHM were calculated. The 
cumulative 'observed' counts are overplotted as solid lines. 

A comparison to confusion can be made by defining 70 
per cent, for example, as a satisfactory level of completeness 
and determining the flux density level where this complete- 
ness is reached. These were determined from each simulation 
and plotted in Figs. ^ and ^ as squares. The required 70 per 
cent completeness correspond to 3-6(Tconf confusion noise 
levels at FWHMs of several arc-seconds and up to lOaconf 
at smaller FWHMs. 

The dashed lines in Figs. §and0 show the limits where 
there are 20 beams per source. The completeness limit is 
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Figure 9. Same as previous figure, but for the baseline 8 \xm 
model: in aj the size of the PSF changes while \nb) the noise floor 
is different from image to image. If counts are very steep close to 
the detection limit, it becomes increasingly difficult to measure 
fluxes without overestimating the true flux. This shows itself as a 
bump in the extracted counts just before the completeness limit. 



close to this level. Confusion limits are often quoted as being 
at 30 - 40 beams/source, which is realistic when higher com- 
pleteness is required (or the definition of a beam is smaller, 
as eg. in Hogg 2000). Also, the example above showed a very 
idealized case regarding object detection - perfect Gaussian 
point sources, no additional noise or source clustering. We 
will return to the issue of detection below. 

Another way to derive confusion limits from simulations 
is to test how the extraction of sources changes as noise is 
added to an intrinsically noiseless image. The same simu- 
lated images as above were used, except that now the real 
IRAC PSFs for the two bands were used (FWHMs for both 
bands are « 2.3"). Fig. ^ shows the same intrinsic cumula- 
tive counts as earlier and the extracted counts correspond- 
ing to different amounts of additional noise. Starting from 
the bottom, the noise levels correspond approximately to 
the following integration times (IT, in seconds): 12, 24, 48, 
60, 90, 100, 200, 500, 1200, 3100, 7900, 19800, and 49600 
s. We have used the low background environment and the 
full-array readout mode, which has only IT's of 12, 30, 100 
and 200 s available, each of which has its own sensitivity 
(an IT of 60 seconds results in a different noise level de- 
pending on whether it is acquired with 5 coads of 12 s IT's 
or 2 readouts of 30 s IT's). The top-most observed curve is 
the truly confusion-limited case, where no noise is added. It 
is somewhat lower than the case with a Gaussian PSF of 
approximately the same size as used in Fig. ^ (thick line), 
showing the effect of the extended wings in the real PSF. 

The corresponding case for the 8 \im baseline model is 
shown in Fig. ^b. The corresponding integration times (in 
ksec) for the curves starting from bottom are: 0.1, 0.3, 0.7, 
1.8, 4.4, 10.9, 27.1, 67.0, 166, 411, 1020, and 2530 ksecs. The 
top-most curve has already merged with the no-noise curve 



- however, it has an IT of 700 hours, an amount only the 
largest Legacy Science projects have. 

6.2 Reliability of detections 

Naturally the extraction algorithm plays a role in construc- 
tion of the observed source counts and the exact determina- 
tion of completeness levels. For example, it is seen that the 
extracted counts show an excess compared to input sources 
in Figs. ^ and ^ Examination of input and output sources 
in our simulations showed that this bump is mostly caused 
by over-estimated fluxes of sources close to the detection 
limit. The over-estimation is due to a 'pedestal effect': unre- 
solved sources are entering the aperture where the source's 
magnitude is measured, and normal sky-subtraction rou- 
tines are biased because of the non-Gaussian noise com- 
ing from the positive-only signal of faint sources. We found 
this to be typical to all photometric systems based on aper- 
ture or isophotal photometry in very crowded fields. PSF- 
fitting techniques, or extremely small apertures with sub- 
sequent aperture corrections, give more accurate results in 
point source cases, such as the current test; however, these 
methods in turn have complications with real data where 
other shapes of objects are expected as well. The common 
Malmquist/Eddington-like bias is also partially responsible 
for the bump in the counts. It affects directly the numbers 
of sources counted: as there are more objects per flux (or 
magnitude) interval below the flux limit than above it, pho- 
tometric errors preferentially scatter more fainter sources to 
a brighter flux bin than vice versa. 

The depth reached can be optimized by tuning ap- 
propriate detection parameters in the selected extraction 
method. However, a detailed study of detection and pho- 
tometric techniques is out of the scope of this paper. To iso- 
late the completeness vs. confusion effects, we merely took 
care to be consistent from one test to another using small 
aperture magnitudes within SExtractor. 

It is worth pointing out, however, that regardless of the 
extraction algorithm used the source count slope and the 
bright sources have a significant and large effect on confusion 
and completeness. In the traditional parlance of Section ^ 
the high response tail of the R{x) distribution strongly af- 
fects the detection of sources. And it is the slope of the 
counts which determines R{x). To illustrate this point, we 
performed simulations with pure power-law source counts. 

Consider two source counts, showed in Fig. [lo|a. The dif- 
ferential counts have a power-law form N{S) oc S~~' , where 
the shallow curve has 7 = 1.5 while the steep one obeys 
Euclidean counts with 7 = 2.5. Both are adjusted to give 
counts similar to the ISO-fitted 8\im counts (dotted line) 
at ~ 10(i.Jy. These counts are urealistic, of course, but do 
illustrate clearly the dangers of adopting confusion limits 
without checking how they are derived. The squares show 
the observed counts from images made using the two galaxy 
counts. The characteristic bump of the observed counts is 
seen with the steep slope. A steady decrease of completeness 
is evident with the shallow counts: the 70 per cent com- 
pleteness limit is at 10ii.Jy while for the Euclidean counts 
the limit is closer to 5^.Jy. The reason for incompleteness 
is seen in panel b) where the cumulative surface density 
of sources is plotted. There are many bright objects in the 
shallow count; 50 beams/source is reached at 100 (xJy where 
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Figure 10. An illustration of effects of confusion noise and confusion limit using power-law source counts. Panel a) shows two source 
counts, Euclidean counts as the dashed line, and shallower counts as the solid line (the latter is multiplied by 10* for clarity). The dotted 
line shows the ISO-fitted 8 \xm counts for reference. Panel b) plots the cumulative surface density as beams/source; c) shows the pixel 
histrogram (or the total R{x) distribution). The inset has the same curves (number of pixels as a function of fiux) but plotted on linear 
axes to clarify the low response region; panel d) plots the confusion noise, c^onf '^s a function of beam FWHM. 



the completeness starts to drop. The 70 per cent complete- 
ness is at 17 beams/source. For the steep counts the drop 
also begins at « 50 beams/source, but is much faster, and 
by « 20 beams/source completeness correction of a factor 
of 5 would be needed. This is because we are approaching 
confusion noise instead of confusion limit 

Panel c) shows the pixel histograms of the images; these 
are equivalent to the total R{x) distribution (Eq. ^). The dis- 
tribution approaches a Gaussian as the slope steepens; with 
shallow slopes the histrogram approaches the N{S) distribu- 
tion (eg. Condon 1974). The width of the distribution is re- 
lated to the calculated confusion noise aconf . This is shown in 
panel d) as a function of FWHM. The (Jconf (^o) dependence 
of Eq. M is also verified. It is seen that the CTconf (6' ~ 2.3") 
from the Euclidean counts is a factor of 2 higher than from 
the shallow counts, in direct contrast with the expectation 
from source extraction simulation and surface density anal- 
ysis above. For the steeper counts one can assign e.g. 5 viJy 
as the confusion limit, which would be equivalent to 4.5aconf 
or 32 beams/source. The corresponding numbers at 70 per 



cent completeness (10^Jy) for shallow curve are IQaconf and 
17 beams/source. 

It thus turns out that the slope of the counts determines 
which sources dominate the confusion. If the source count 
slope is shallow, it is the bright sources which dominate, 
reducing the completeness of detection of an object at a 
given flux level. In this case the rule-of-thumb taken at ^ 
20 beams/source (lower for shallow counts and higher for 
steeper ones) gives a good idea of the practical confusion 
limit. 

If the slope is steep (7 > 2) the confusion noise due to 
fainter sources starts reducing the completeness faster than 
the effect of bright neighbouring sources (see also discussion 
in Helou & Beichman 1990). In this case the traditional way 
of calculating cTconf from Equation 5, and using ~ lOaconf 
for the confusion limit is more appropriate. 

6.3 Integration times for IRAC 

Figures |ll|, ^ and |l^ show the expected completeness lev- 
els using different integration times for observations with 
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Figure 11. Fractional completeness level as a function of the flux 
of the input source. The image in this Monte Carlo simulation is 
constructed from the baseline model at 3.6 ycca. The curves are 
labelled with the integration time in thousands of seconds. The 
thick solid line is the truly confusion limited case, with no noise 
added to the image, ie. infinite integration time. 



Figure 13. Same as previous, but using the ISO-fitted model 
to construct the image. Confusion determined from this plot is 
clearly higher than in the baseline model case. For example, ob- 
jects with intrinsic fluxes of 10 [ojy can never be detected with 90 
per cent completeness in ISO-fitted model, whereas, if the base- 
line model represents the universe, this could be done in about 4 
hours. 
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Figure 12. Same as previous, but for the 8 \xm band baseline 
model. 

IRAC's 3.6 and 8 \ixn filters with the realistic PSF's. Here we 
used a comprehensive Monte Carlo simulation: rather than 
extracting the source count once from an simulated image 
(as was done for Figs. ^ and test sources of a given flux 
were placed one-by-one on the simulated frames and then ex- 
tracted. In each flux bin for every simulation with different 
noise floor, the test was performed 2000 times. The integra- 
tion times corresponding to the different la noise floors are 
printed in ksecs. We show the baseline models and also the 
ISO-fltted model for the 8 yim band. 

A 'detection' can be defined in various ways. Here it was 
defined as finding a source (the closest one) within a FWHM 



radius and within a 4(t flux interval determined from the 
observed spread of fluxes at the given flux density. Making 
the selection in this way we, flrst of all, avoid a serious a 
priori knowledge bias which would be introduced if the ex- 
act same input source were searched for from the simulated 
image. Secondly, we chose the mentioned flux cut to be as 
little as possible dependent on the used photometric tech- 
nique. If, for instance, we had required a detection to be 
within ±30 per cent of the input flux (this is used in Hogg 
2000) there would have been more non-detections due to the 
typical brightening of sources when approaching the confu- 
sion limit. In this case the resulting no-noise completeness 
curves in Figs. pT| - would be lower by a factor of ~ 1.5 
at a few nJy level. 

Figures |ll| to ^ can be used to estimate the longest use- 
ful integration times for the IRAC instrument in deep extra- 
galactic surveys. To be totally confusion limited at 3.6 ^m, 
i.e. that the incompleteness would be due only to confusion, 
ITs of approximately 50 ksec (14 hours) would be needed. 
This is within the scope of a Legacy Science type project, 
where a 'noise map' of unresolved background sources from 
a reasonable size of area could be obtained for fluctuation 
studies. 

On the other hand, as far as direct source counts are 
concerned, much shorter ITs are adequate. Consider a 500 s 
exposure. At 6 \iJy, sources would still be detected to the 
confusion limit; that is sources would be lost at 20 per cent 
rate due to confusion only. Half of all 4 yijy objects would be 
undetected; ~ 50 per cent of these are lost due to confusion 
and the other half due to sky noise. This 4 ^.Jy level corre- 
sponds to approximately 15aconf as calculated from Eq. ^ 
Moreover, it is easy to see that the confusion limit is ap- 
proached; if the IT is increased by a factor of 10, sources 



© 2000 RAS, MNRAS 000, §-13 



Confusion limit due to galaxies: using SIRTF 's Infrared Array Camera 11 



Table 1. Confusion limits in \xjj for two IRAC bands using several models. Different 
values of confusion are given according to the method the limit is derived. In all 
the models in this table a value at ^ 20 beams/source, or ^ lOiTconf gives a good 
estimate {'adopted confusion limit') of the flux density level which could ultimately 
be reached with accurate enough (lOcr with ideal photometry) observations. However, 
for a practical limit, the required integration time (IT) would need to be considered 
as well. 



Method 


3.6 [om baseline 


8 \Ma baseline 


8 Mm ISO-fitted 


1-source/beam 


0.02 


0.07 


0.06 


20-source/beam 


2.0 


2.3 


4.2 


confusion noise (Jconf 


0.25 


0.40 


0.51 


Flux at 70 per cent compl. 


2.2 


2.3 


4.1 


Adopted confusion limit 


3 


3 


4 


Corresponding IT (ksec) 


3 


150 


50 



only 1.5 times fainter would be detected with the same con- 
fidence level. 

If only the 3.6 [im band were considered, we would con- 
clude that it does not make much sense to extend the in- 
tegration time beyond 1 hour. With this IT 3 (i.Jy sources 
are well detected. However, one is pushing confusion already 
since only ~ 60 per cent of all sources at that flux level could 
be extracted (it becomes unreliable to interpret and model 
the count slopes if completeness corrections exceed a factor 
of 2). This limit corresponds to approximately 25 beams per 
source. 

In the longest IRAC waveband, with the baseline model, 
integration times exceeding 10 hours would still uncover new 
sources at 3 — 4(i.Jy level with only modest completeness 
corrections of ~ 1.2. However, truly confusion limited images 
at 8 \im are much harder to reach than at 3.6 \im - they 
would require nearly 100 hours of integration time. 

A practical 8 vim confusion limit is reached at a brighter 
flux level if galaxy counts are better modelled with the ISO- 
fltted counts. Requiring 70 per cent completeness (with ~ 
10(7 ideal photometric detections), sources at 4 \ijy would 
be detected with the ISO-fltted model, while 2 fi.Jy objects 
could be reached with the baseline model. However, in the 
latter case the IT needed would exceed the scope of any 
realistic project. 

For the 8 [im band we conclude that an integration 
time of about 15 hours would be both sufficient and useful 
for performing a deep survey (at the same time this would 
allow truly confusion limited images from the two shortest 
wavelength filters). Sources at 4 \i3y would be at 8(Tconf or 
llcTconf depending whether galaxy counts are closer to the 
ISO-fitted or baseline model, respectively. Respective com- 
pleteness levels would be at « 60 and 80 per cent. The ISO 
and baseline models give 20 and 40 beams/source at 4 \iJy, 
respectively. 

Table 1 summarizes the integration times along with 
the expected values of confusion for the main models used 
in this work. It gives the confusion limit and noise in the 
several different methods presented in this work. Specifi- 
cally, it can be seen that the one-beam-per-source limit is 
nearly two orders of magnitude too low. Because the faint 
mid-IR counts are flatter in the 3.6ii-m band, the overall con- 
fusion limit is best described by approximately 15 (Tconf- In 
the longer IRAC waveband the limit is ^ 10. The steepest 
counts are the baseline 8yim - the resulting adopted confu- 
sion is approximately 25 beams/source, while for the other 



two cases in the table confusion could be described as ^ 20 
beams per source. 

There are indications that the IRAC resolution might 
prove to be better than the instrument speciflcations used 
here (2.3"). If this will be the case, confusion estimates 
change somewhat, especially for the two shortest wavelength 
bands. If the 3.6 yuoa band would achieve an in-flight PSF of 
PS 1", the (Tconf value would decrease by a factor of 7 (taking 
into account the 1.2"pixel size). A limit of 25 beams/source 
would be at 0.5 \i3y. It would therefore become difficult 
to reach confusion limited images in this band. The 8 [i 
band estimates would not change significantly from the val- 
ues presented above. The possibly improving resolution will 
be counteracted by real images, which are not so ideal as the 
point-source simulations performed here but will be spatially 
resolved galaxies. 

Finally, note that if 90 per cent completeness were de- 
sired, confusion limits are more appropriately at ~ 10(i.Jy, 
or at around 100 beams/source. If 80 per cent completeness 
is enough, IRAC confusion limits are at ~ ()\^3y■ 

6.4 Detection of high-2: objects 

The study of high-redshift universe with SIRTF is highly 
dependent on obtaining accurate enough photometric red- 
shifts of distant galaxies. This can be performed by tracking 
the signature of the H~ opacity feature in galaxy spectra 
as it passes through the IRAC filters (Wright, Eisenhardt 
& Fazio 1994). As far as IRAC sensitivities are concerned, 
Simpson & Eisenhardt (1999) show that a realistic goal for 
the deep galaxy surveys is to determine these redshifts ac- 
curately for L* galaxies up to 2 ~ 4. However, they did 
not consider confusion in their calculations, though they 
refer to the IcTconf ~ 0.5viJy expected from Franceschini 
et al. (1991). The results thus seem optimistic compared 
to work here. Flux densities of 1-2 fi.Jy seem impossible to 
reach. Certainly many individual sources at this level will 
be extracted, but they come from deep within the confused 
regime, so that obtaining global properties would be highly 
uncertain due to large completeness and photometric cor- 
rections. 

According to our confusion estimates, and using model 
spectra of an evolved L* galaxy (Fazio et al. 1998), these 
galaxies can be detected out to 2: « 3.5 using the base- 
line model case. This is obtained by setting 3 piJy as the 
lowest flux density level where object photometry can be 
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determined with enough accuracy (we assume lOo" in ideal 
case, which still leaves room for source extraction in prac- 
tice, when objects are not point sources). In the ISO-fitted 
case z ~ 3 could still be reached. 

The situation changes if the highest amplitude ISO- 
CAM galaxy counts (HDF-N counts of Oliver et al. 1997) 
prove to be the most accurate counts. If the bright bump 
-model (higher dashed line in Figs. I audi) were used, the 
8 \i.m band would already become practically confusion lim- 
ited at 10 ii.Jy. This would limit high-z observation to z ~ 2. 

It is just these faint IR-galaxies with unknown popula- 
tion characterisctics that SIRTF projects will be studying. 
If the deep surveys detect high counts and surface densities, 
as would be expected from ISOCAM results, it would imme- 
diately imply strong evolution (e.g. Elbaz et al. 1998), and 
possibly hitherto unknown evolutionary stages of infrared- 
bright objects. On the other hand, that would also necessar- 
ily mean that the IRAC instrument is becoming confusion 
limited sooner than expected, and the highest-z studies of 
galaxies may suffer. Surveys reaching galaxies at z > 3 are 
possible only if the mid-IR counts are at a moderate level 
and/or the in-flight IRAC PSF performance will be at the 
most optimistic values in all IRAC filters. 



7 SUMMARY 

We have presented empirically based estimates for confusion 
due to galaxies in mid infrared wavelengths. For SIRTFs 
IRAC bands the confusion limits are at 3 - 4 ^.Jy, if com- 
pleteness corrections by a factor of 1.3 to intrinsic counts 
are allowed. The la confusion noise ranges from 0.2 to 0.5 
|j.Jy moving from the 3.6 |i.m waveband to the 8 [im. If the 
highest amplitude KO-based mid-IR galaxy counts will be 
confirmed, the confusion limit is higher by a factor of two. 

We have shown the differences between commonly used 
confusion definitions. Specifically we separated confusion 
noise from confusion limit, so that the first relates to the 
fluctuations from unresolved sources and the latter is defined 
by source surface density arguments or derived by complete- 
ness analysis of source detections. The slope of counts de- 
termines which confusion determination is more appropriate 
to use. With steep counts it is the faint source fluctuations 
which dominate, and when the count slope is shallow bright 
sources decrease completeness efficiently and thus determine 
the confusion limit. The combination of IRAC resolution and 
typical mid-IR source counts is an intermediate case, where 
the overall confusion limit (~ 4n.Jy) can be found at 10-20 
times the confusion noise cTconf- 

We conclude that to make a useful high-redshift {z ^ 3) 
deep survey with 5'/i?T'i<'/IRAC, integration times of at least 
15 hours should be used. Such a survey would be confusion 
limited at 3.6 and 4.5 yim, if the original IRAC design re- 
quirement resolution is used. If sharper images are obtained, 
3.6 ^m images will become confusion limited at similar inte- 
gration times as the 8 yim channel, i.e. only after 100 hours. 



REFERENCES 



J.L., 1999, A&A, 342, 



Altieri B., et al., 1999, A&A , 343, L65 
Aussel A., Cesarsky C.J., Elbaz D., Starck, 
313 

Barcons X., 1992, ApJ, 396, 460 

Berlin E., Arnouts S., 1996, A&AS, 117, 393 

Blain A.W., Ivison R.J., Smail I., 1998, MNRAS, 296, L29 

Bruzual A.G., Chariot S., 1993 ApJ, 405, 538 

Condon J. J., 1974, ApJ, 188, 279 

Elbaz D. et al., 1999, A&A, 351, L37 

Fanson J., Fazio C, Houck J., Kelly T., Rieke C, Tenerelli D., 
Whitten M., 1998, in Bely P.Y., Breckinridge J.B., eds, Proc. 
SPIE 3356, Space Telescopes and Instruments, p. 478 

Fazio G.G. et al., 1998, in Fowler A., ed, Proc. SPIE 3354, Infrared 
Astronomical Instrumentation, p. 1024 

Flores H., et al., 1999, A&A, 343, 389 

Franceschini A., Toffolatti A., Danese L., De Zotti G., 1989, ApJ, 
344, 35 

Franceschini A., Toffolatti A., Mazzei P., Danese L., De Zotti G., 

1991, A&AS, 89, 285 
Franceschini A., et al., 1997, in Wilson E., ed., Proc. ESA symp., 

The far infrared and submillimeter universe, ESA SP-401, p. 

159 

Gardner J. P., 1998, PASP, 110, 291 

Gardner J. P., Sharpies R.M., Frenk C.S., Carrasco B.E., 1997, 

ApJ, 480, L99 
Genzel R., Cesarsky, C.J., 2000, ARA&A, 38, 761 
Guiderdoni B., Hivon E., Bouchct F.R., Maffei B., 1998, MNRAS, 

295, 877 

Hacking P., Houck J.R., 1987, ApJS, 63, 311 

Helou G., Beichman, C.A., 1990, in Kaldeich, B., ed.. From 
Ground-Based to Space-Borne Sub-mm Astronomy, Liege 
Int. Astroph. Coll., Paris, ESA, p. 117 

Hogg D.W., 2001, AJ, 121, 1207 

Mattila K., Lehtinen K., Lemke D., 1999, A&A, 342, 643 
Oliver S., et al., 1997, MNRAS, 289, 471 
Oliver S., et al., 2000, MNRAS, 316, 749 
Pearson CP., Rowan-Robinson M., 1996, MNRAS, 283, 174 
Pearson CP., Matsuhara H., O naka T.. Watarai H .. Matsumoto 



T., 2000, MNRAS, preprint, |astro-ph/0008472 
Roche N., Eales S.A., 1999, MNRAS, 307, 111 
Scheuer P.A.G., 1957, Proc. Cambridge Phil. Soc, 53, 764 
Scheuer P.A.G., 1974, MNRAS, 166, 329 
Serjeant S., et al., 2000, MNRAS, 316, 768 

Silva L., Granato G., Bressan A., Danese L., 1998, ApJ, 509, 103 

Simpson C, Eisenhardt P., 1999, PASP, 111, 691 

Szokoly CP., Subbarao M.U., Connolly A.J., Mobasher B., 1998, 

ApJ, 492, 452 
Taniguchi Y., et al., 1997, A&A, 328, L9 

Toffolatti L., Argiieso Gomez F., de Zotti C, Mazzei P., Frances- 
Danese L., Burigana C, 1998, MNRAS, 297, 117 
1996, A&A, 315, 21 

Tollestrup E.V., Willner S.P., Cohen M., 2000, ApJ, 



Pauliny-Toth I.I.K., Witzel A., 1982, 



chini A., 
Vaisanen P., 
Vaisanen P., 
540, 593 
Wall J. v., Scheuer P.A.G. 

MNRAS, 198, 221 
Wright E.L., Eisenhardt P., Fazio C, 1994, AAS, 26, 893 
Xu C, et al., 1998, ApJ, 508, 576 
Yoshii Y., Takahara F., 1988, ApJ, 326, 1 

This paper has been produced using the Royal Astronomical 
Society/Blackwell Science MJ^X style file. 



ACKNOWLEDGEMENTS 

P.V. wishes to thank the Finnish Academy and the Smith- 
sonian Institution for support during this research. 

© 2000 RAS, MNRAS 000, §-13 



